The Reinforcement Learning Competitions

نویسندگان

  • Shimon Whiteson
  • Brian Tanner
  • Adam White
چکیده

Events such as RoboCup, the Netflix Prize, the Trading Agent Competition, the International Planning Competition, the General Game Playing Competition, the AAAI Computer Poker Competition, and the DARPA Grand Challenge have succeeded in raising awareness and stimulating research about their respective topics. The empirical, problem-oriented nature of these competitions can be an important counterweight to traditional research efforts, which often focus more on theoretical results and algorithmic innovation. Competition results provide a barometer for which approaches are popular, effective, and scalable to challenging problems. The competitive nature of the events provides an incentive to transform theoretical ideas into practical tools, enabling the field to reap the benefits of its scientific progress. The field of reinforcement learning (RL) (Kaelbling, Littman, and Moore 1996; Sutton and Barto 1998), is ripe for such a transformation. Broadly speaking, RL researchers aim to develop online algorithms for optimizing behavior in sequential decision problems (SDPs), wherein agents interact with typically unknown environments and seek behavior that maximizes their long-term reward. Many challenging and realistic domains can be cast in this framework (for example, robot control, game playing, and system optimization), so RL algorithms contribute to the broad goals of artificial intelligence. In recent years, many advances have been made in RL theory and algorithms, particularly in areas such as balancing exploration and exploitation, Articles

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Unsupervised Learning Method for an Attacker Agent in Robot Soccer Competitions Based on the Kohonen Neural Network

RoboCup competition as a great test-bed, has turned to a worldwide popular domains in recent years. The main object of such competitions is to deal with complex behavior of systems whichconsist of multiple autonomous agents. The rich experience of human soccer player can be used as a valuable reference for a robot soccer player. However, because of the differences between real and simulated soc...

متن کامل

Reinforcement learning for robot soccer

Batch reinforcement learning methods provide a powerful framework for learning efficiently and effectively in autonomous robots. The paper reviews some recent work of the authors aiming at the successful application of reinforcement learning in a challenging and complex domain. It discusses several variants of the general batch learning framework, particularly tailored to the use of multilayer ...

متن کامل

RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments

RL-Glue is a standard, language-independent software package for reinforcement-learning experiments. The standardization provided by RL-Glue facilitates code sharing and collaboration. Code sharing reduces the need to re-engineer tasks and experimental apparatus, both common barriers to comparatively evaluating new ideas in the context of the literature. Our software features a minimalist inter...

متن کامل

Modular Learning Systems for Soccer Robot

This paper presents a series of the studies of modular learning system for vision-based behavior acquisition of a soccer robot participating in middle size league of RoboCup (Asada, et al. 1999). Reinforcement learning has recently been receiving increased attention as a method for behavior learning with little or no a priori knowledge and higher capability of reactive and adaptive behaviors. H...

متن کامل

A hybrid-learning based broker model for strategic power trading in smart grid markets

Smart Grid markets are dynamic and complex, and brokers are widely introduced to better manage the markets. However, brokers face great challenges, including the varying energy demands of consumers, the changing prices in the markets, and the competitions between each other. This paper proposes an intelligent broker model based on hybrid learning (including unsupervised, supervised and reinforc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010